AITopics | national library

Collaborating Authors

national library

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Swedish Whispers; Leveraging a Massive Speech Corpus for Swedish Speech Recognition

Vesterbacka, Leonora, Rekathati, Faton, Kurtz, Robin, Sikora, Justyna, Toftgård, Agnes

arXiv.org Artificial IntelligenceAug-15-2025

This work presents a suite of fine-tuned Whisper models for Swedish, trained on a dataset of unprecedented size and variability for this mid-resourced language. As languages of smaller sizes are often underrepresented in multilingual training datasets, substantial improvements in performance can be achieved by fine-tuning existing multilingual models, as shown in this work. This work reports an overall improvement across model sizes compared to OpenAI's Whisper evaluated on Swedish. Most notably, we report an average 47% reduction in WER comparing our best performing model to OpenAI's whisper-large-v3, in evaluations across FLEURS, Common V oice, and NST.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.17538

Country:

Europe (0.48)
North America > United States (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.49)

Add feedback

Visual Navigation of Digital Libraries: Retrieval and Classification of Images in the National Library of Norway's Digitised Book Collection

Roald, Marie, Birkenes, Magnus Breder, Johnsen, Lars Gunnarsønn Bagøien

arXiv.org Artificial IntelligenceOct-19-2024

Digital tools for text analysis have long been essential for the searchability and accessibility of digitised library collections. Recent computer vision advances have introduced similar capabilities for visual materials, with deep learning-based embeddings showing promise for analysing visual heritage. Given that many books feature visuals in addition to text, taking advantage of these breakthroughs is critical to making library collections open and accessible. In this work, we present a proof-of-concept image search application for exploring images in the National Library of Norway's pre-1900 books, comparing Vision Transformer (ViT), Contrastive Language-Image Pre-training (CLIP), and Sigmoid loss for Language-Image Pre-training (SigLIP) embeddings for image retrieval and classification. Our results show that the application performs well for exact image retrieval, with SigLIP embeddings slightly outperforming CLIP and ViT in both retrieval and classification tasks. Additionally, SigLIP-based image classification can aid in cleaning image datasets from a digitisation pipeline.

artificial intelligence, image retrieval, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2410.14969

Country:

Europe > Norway (0.61)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Austria > Vienna (0.14)
(9 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain

García-Ferrero, Iker, Agerri, Rodrigo, Salazar, Aitziber Atutxa, Cabrio, Elena, de la Iglesia, Iker, Lavelli, Alberto, Magnini, Bernardo, Molinet, Benjamin, Ramirez-Romero, Johana, Rigau, German, Villa-Gonzalez, Jose Maria, Villata, Serena, Zaninello, Andrea

arXiv.org Artificial IntelligenceApr-11-2024

Research on language technology for the development of medical applications is currently a hot topic in Natural Language Understanding and Generation. Thus, a number of large language models (LLMs) have recently been adapted to the medical domain, so that they can be used as a tool for mediating in human-AI interaction. While these LLMs display competitive performance on automated medical texts benchmarks, they have been pre-trained and evaluated with a focus on a single language (English mostly). This is particularly true of text-to-text models, which typically require large amounts of domain-specific pre-training data, often not easily accessible for many languages. In this paper, we address these shortcomings by compiling, to the best of our knowledge, the largest multilingual corpus for the medical domain in four languages, namely English, French, Italian and Spanish. This new corpus has been used to train Medical mT5, the first open-source text-to-text multilingual model for the medical domain. Additionally, we present two new evaluation benchmarks for all four languages with the aim of facilitating multilingual research in this domain. A comprehensive evaluation shows that Medical mT5 outperforms both encoders and similarly sized text-to-text models for the Spanish, French, and Italian benchmarks, while being competitive with current state-of-the-art LLMs in English.

dataset, medical domain, medical mt5, (16 more...)

arXiv.org Artificial Intelligence

2404.07613

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)
(15 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Boosting Norwegian Automatic Speech Recognition

de la Rosa, Javier, Braaten, Rolv-Arild, Kummervold, Per Egil, Wetjen, Freddy, Brygfjeld, Svein Arne

arXiv.org Artificial IntelligenceJul-4-2023

In this paper, we present several baselines for automatic speech recognition (ASR) models for the two official written languages in Norway: Bokm{\aa}l and Nynorsk. We compare the performance of models of varying sizes and pre-training approaches on multiple Norwegian speech datasets. Additionally, we measure the performance of these models against previous state-of-the-art ASR models, as well as on out-of-domain datasets. We improve the state of the art on the Norwegian Parliamentary Speech Corpus (NPSC) from a word error rate (WER) of 17.10\% to 7.60\%, with models achieving 5.81\% for Bokm{\aa}l and 11.54\% for Nynorsk. We also discuss the challenges and potential solutions for further improving ASR models for Norwegian.

artificial intelligence, machine learning, speech recognition, (15 more...)

arXiv.org Artificial Intelligence

2307.01672

Country:

Europe > Norway > Eastern Norway > Oslo (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Sweden > Östergötland County > Linköping (0.04)
(3 more...)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Artificial intelligence uncovers lost work by titan of Spain's 'Golden Age'

The GuardianFeb-5-2023, 08:00:16 GMT

Lost or misattributed works by some of the finest writers of Spain's Golden Age could be discovered thanks to pioneering AI technology that has been used to identify a previously unknown play by the wildly prolific dramatist, poet, sailor and priest Lope de Vega. This week Spain's National Library announced that researchers trawling its massive archive had stumbled upon and verified a play that Lope is believed to have written a few years before his death in 1635. Like many plays of the Spanish Golden Age – the 16th- and 17th-century cultural boom that accompanied Spain's imperial growth and which birthed masterpieces by Lope, Cervantes, Calderón and Velázquez, among many others – La francesa Laura (The Frenchwoman Laura) is a tale of love, jealousy and social hierarchy in which suspicion demands an innocent woman be sacrificed on the altar of her husband's honour. But, unlike many similar plays of the period, Laura survives and the third act ends happily. Equally unusual was the manner of the play's discovery.

francesa laura, golden age, spain, (13 more...)

The Guardian

Country:

Europe > Spain (1.00)
Europe > France (0.06)
Europe > United Kingdom > England (0.05)
Europe > Austria > Vienna (0.05)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Emerging Trends in AI, Ultrasound and OB/GYN Care

#artificialintelligenceSep-22-2022, 16:08:27 GMT

From entertainment to commerce, artificial intelligence (AI) is making a difference in many aspects of life and has the power to advance diagnostics. Next-generation healthcare technology has begun implementing many AI-powered tools to improve efficacy and patient safety, and enhance the clinician experience.1 There are several image acquisition and analysis capabilities that can be enhanced by an AI application for each task.2 Nearly every woman requires an ultrasound at some point during their care. There is huge potential for AI to assist in repetitive tasks and provide promising workload-changing advances with the use of ultrasound in obstetrics and gynecologic (OB/GYN) care.2

application, artificial intelligence, ultrasound and ob gyn care, (13 more...)

#artificialintelligence

Country: Europe > Switzerland > Vaud > Lausanne (0.05)

Genre:

Research Report (0.51)
Overview (0.40)

Industry: Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (0.90)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.31)

Add feedback

The Norwegian Parliamentary Speech Corpus

Solberg, Per Erik, Ortiz, Pablo

arXiv.org Artificial IntelligenceJan-26-2022

The Norwegian Parliamentary Speech Corpus (NPSC) is a speech dataset with recordings of meetings from Stortinget, the Norwegian parliament. It is the first, publicly available dataset containing unscripted, Norwegian speech designed for training of automatic speech recognition (ASR) systems. The recordings are manually transcribed and annotated with language codes and speakers, and there are detailed metadata about the speakers. The transcriptions exist in both normalized and non-normalized form, and non-standardized words are explicitly marked and annotated with standardized equivalents. To test the usefulness of this dataset, we have compared an ASR system trained on the NPSC with a baseline system trained on only manuscript-read speech. These systems were tested on an independent dataset containing spontaneous, dialectal speech. The NPSC-trained system performed significantly better, with a 22.9% relative improvement in word error rate (WER). Moreover, training on the NPSC is shown to have a "democratizing" effect in terms of dialects, as improvements are generally larger for dialects with higher WER from the baseline system.

artificial intelligence, dataset, speech recognition, (14 more...)

arXiv.org Artificial Intelligence

2201.10881

Country:

Europe > Norway > Eastern Norway > Oslo (0.05)
Europe > Norway > Western Norway > Vestland > Sogn og Fjordane (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback

How machine learning is bringing National Library of Scotland's maps to life

#artificialintelligenceOct-2-2020, 22:05:41 GMT

What if machine learning meant that you didn't have to have a definitive starting point and the reams of records in the archives could be explored and enjoyed visually? That is the vision of Martin Disley who has been creating datasets from across the National Library of Scotland's (NLS) map collection. His project, which is part of the Creative Informatics Resident Entrepreneur project at the University of Edinburgh, curated datasets of images previously scanned by the NLS to feed a machine learning model.The newly-created machine learning model then creates'fake' versions of the images that it is trained upon. The generated output from this process can be animated to produce visions of machines dreaming, in this case the fake maps animated and brought to life. This has the effect of synthesising these large collections down in short videos.

artificial intelligence, machine learning, national library, (9 more...)

#artificialintelligence

Country: Europe > United Kingdom > Scotland (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Fantastic Futures 2019 Conference

#artificialintelligenceOct-18-2019, 16:18:18 GMT

Stanford Libraries will host the 2nd International Conference on AI for Libraries, Archives, and Museums over three days, December 4, 5 & 6, 2019. The first'Fantastic Futures' conference, which took place in December 2018 at the National Library of Norway in Oslo, initiated a community-focused approach to addressing the challenges and possibilities for libraries, archives, and museums in the era of artificial intelligence. The Stanford conference will expand that charge, adding to the plenary gathering a full day of workshops and a half day'unconference' shaped by the interests of those assembled. Wednesday, December 4, will be a day of plenary sessions to introduce attendees to a range of topics in AI, from the concerns of algorithmic bias and data privacy to the exciting developments in transforming discovery and digital content curation (see the full program). The two keynote addresses reflect Stanford Library's position as an academic center in close proximity to Silicon Valley: Bryan Catanzaro, the Vice President of Applied Deep Learning at Nvidia, will speak to the important contribution he thinks libraries can make in AI.

fantastic futures 2019, library, national library, (13 more...)

#artificialintelligence

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.40)
Europe > Norway > Eastern Norway > Oslo (0.25)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.25)
(4 more...)

Industry: Information Technology > Security & Privacy (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

Medication Management System That Uses AI To Help Doctors Treat At-Risk Patients Better

International Business TimesJun-28-2017, 13:30:21 GMT

Poor adherence is a widespread medical problem, which has poor health outcomes and inflates healthcare costs. According to the U.S. National Library of Medicine, 75 percent of Americans face trouble taking medicine as instructed by their doctors. Israeli personalized medication management platform, Medisafe, wants to change this using artificial intelligence (AI). The start-up uses AI and machine learning on its medication adherence platform. It passively collects data from patients, such as medications prescribed, health measurements and uses self-learning algorithms, which can help a patient adhere to instructed medication better.

artificial intelligence, machine learning, platform, (12 more...)

International Business Times

Country: North America > United States (0.17)

Industry:

Health & Medicine > Consumer Health (0.72)
Health & Medicine > Health Care Providers & Services (0.56)
Health & Medicine > Government Relations & Public Policy (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.59)

Add feedback